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(g) A method of detecting the presence of multiple language audio channels in a digital audio signal. 

(g) During recording, a multilingual indication is 
generated when at least two of the audio chan- 
nels of a digital audio signal are associated with 
a common program, and the multilingual indi- 
cation is combined with the digital audio signal. 
During reproduction, a display is activated 
when a multilingual indication indicates that at 
least two of the audio channels included in a 
digital audio signal are associated with a com- 
mon program. 
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This invention relates to the generation and reproduction of data. More particularly, but not exclusively, 
the invention relates to digital video and audio signal recording and reproducing apparatus, and, in particular, 
to control data associated with digital audio signals. 

Broadcasting of stereo programs and bilingual programs or multilingual programs such as the Euro Sports 
s channel has occurred for many years, and the number of such broadcasts is expected to increase. 

In the present broadcasting system, only stereo programs and bilingual (main language and sub language) 
programs can be distinguished. An example of a bilingual broadcast is a program in which the main language 
is Japanese and the sub language is English. Another example of a bilingual broadcast is a sports program in 
which the main language" is a commentary and the sub language" is a commentary about a team. As used 
10 herein and in the claims, "multilingual" encompasses both of the aforementioned examples, namely, audio sig- 
nals with the same content in different languages and audio signals with different contents in the same lan- 
guage. . 

In a commercial digital video cassette recorder (VCR) format that has been previously proposed, recording 
and reproducing formats of digital audio signals have been defined for both High Definition (HD) video and 
is Standard Definition (SD) video. For SD signals, a two-channel mode (SD 2 ch) and a four-channel mode (SD 
4 ch) have been defined. For HD signals, a four-channel mode (HD 4 ch) and an eight-channel mode (HD 8 
ch) have been defined. These four modes are considered in two groups. The amount of data used to represent 
a SD or HD signal in the first of the modes is twice the amount of data used to represent the signal in the second 
of the modes. 

20 The first group, in which 16-bit linear quantizing is performed at sampling frequencies of 48 kHz, 44.1 kHz, 
and 32 kHz, includes the two-channel mode of SD and the four-channel mode of HD, shown in Figs. 1 A and 
1C, respectively. 

The second group, in which 12-bit non-linear quantizing is performed at a sampling frequency of 32 kHz, 
includes the four-channel mode of SD and the eight-channel mode of HD, shown in Figs. 1 B and 1 D, respeo- 
25 tively. 

Ten tracks are used to represent one SD video frame according to the NTSC standard (525 lines/60 Hz). 
Twelve tracks are used to represent one SD video frame according to the PAL standard (525 lines/50 Hz). In 
ten tracks (60 Hz SD system) or twelve tracks (50 Hz SD system), there is also capacity for two channels of 
digital audio signals each in 16-bit mode or four channels in 12-bit mode. 
30 Twenty tracks (1125 lines/60 Hz) or twenty four tracks (1250 lines/50 Hz) are used to represent one HD 
video frame. Thus, when an HD signal is recorded, in the 1 6-bit mode, audio signals for four channels can be 
recorded, and, in the 12-bit mode, audio signals for eight channels can be recorded. 
. , - . r ^control data area AAUX for recording control data associated with the digital audio .data is provided in 
each track. To identify the allocation of audio data on a plurality of channels (audio data stored on each charv 
35 nel), a mode signal referred to as AUDIO MODE is recorded in the AAUX area Fig. 2 is a table indicating the 
meaning of the values of the AUDIO MODE data, using the following abbreviations: 



CHN: The number of audio channels represented in a group of five tracks, that is, whether the audio 

signal is recorded using 16 bits or 12 bits. 

L : Left channel of stereo signal 

*o R : Right channel of stereo signal 

M, M1, M2 : Monaural signals 

C : Center channel of three-channel stereo (3/0 stereo) or four-channel stereo (3/1 stereo) 

S : Surround channel of 4-channel stereo (3/1 stereo) 

LS : Left surround channel of 4-channel stereo (2/2 stereo) 

*5 RS: Right surround channel of 4-channel stereo (2/2 stereo) 

?: Indistinguishable 

- : No information 



AUDIO MODE data is recorded in every track. The same AUDIO MODE data is redundantly recorded in each 
of the tracks in a channel. 

When a bilingual broadcast TV program is recorded in an SD two-channel record mode in which the number 
of quantizing bits is 1 6 and the sampling frequency is 48 kHz, as shown in Fig. 3, monaural signals are recorded 
on CH1 and CH2. The AUDIO MODE of both channels, each comprising five tracks, is (0010). In Fig. 3, each 
square indicated by solid line represents audio data of five tracks. However, from the AUDIO MODE signals 
thus recorded, it cannot be determined whether the audio signals are a bilingual. 

As shown in Fig. 4, at least three possibilities exist when the AUDIO MODE is recorded as shown in Fig. 
3. The first possibility is that Japanese language is recorded on channel 1 and English language is recorded 
on channel 2. The second possibility is that general commentary is recorded on channel 1 and the specific 
commentary, such as for a team, is recorded on channel 2. The third possibility is that acting dialogue is re- 
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corded on channel 1 and background music is recorded on channel 2. 

A similar inability to distinguish whether the audio signals recorded in the available channels are related 
exists in the four channel mode of SD, the four channel mode of HD and the eight channel mode of HD. 
When Japanese stereo sounds and English stereo sounds have been recorded on a tape in a four-channel 
5 mode, until they are reproduced, the recorded sounds cannot be distinguished. A VCR that can record audio 
signals on eight channels, such as a HDVCR, can record a four-language stereo broadcasting program simul- 
taneously with the above-mentioned Euro Sports channel. However, in this case, likewise, such a recording 
format cannot be distinguished. 

In accordance with an aspect of this invention, there are provided a method of and a system for generating 
10 data, in which a digital audio signal including a plurality of audio channels is received. A multilingual indication 
is generated when at least two of the audio channels are associated with a common program, and the multi- 
lingual indication is combined with the digital audio signal. 

In accordance with another aspect of this invention, there are provided a method of and a system for re- 
producing data, in which a digital audio signal including a plurality of audio channels and a multilingual indica- 
is tion indicating whether at least two of the audio channels are associated with a common program is received. 
A display is activated when the multilingual indication indicates that at least two of the audio channels are as- 
sociated with a common program. 

A preferred embodiment of the invention disclosed herein below provides: 
a digital audio recording and reproducing system in which the aforementioned disadvantages are avoid- 
20 ed or alleviated; and 

a digital audio signal recording and reproducing apparatus that can easily determine whether digital au- 
dio signals on a plurality of channels represent audio signals with different contents or multilingual audio sig- 
nals that have related contents. 

With the preferred embodiment, a recorded multilingual program can be recognized without need to re- 
25 produce all channels thereof. A main audio signal can be automatically reproduced without need to select a 
channel. Additionally, when multilingual channels are recorded, one of the languages can be selected by the 
user. 

The invention will now be further described, by way of illustrative and non-limiting example, with reference 
to the accompanying drawings, in which corresponding parts are identified by the same reference numerals, 
30 and in which: 

Figs. 1 A-1 D are diagrams showing recording format of digital audio signals; 
Fig. 2 is a table describing AUDIO MODE data; 

Fig.3 is an exaraplaof a previously proposed two channel recording format; ,™^~.™v 

Fig. 4 is a table indicating possible usage of the channels of Fig. 3; 
35 Figs. 5A and 5B show track recording formats; 
Fig. 6 is a diagram of an ITI area; 

Fig. 7 is a more detailed diagram of the track shown in Fig. 5B; 

Rg. 8 is a diagram illustrating the hierarchical structure of the application IDs for the track areas; 
Fig. 9B shows the track area usage when AP1 = AP2 = AP3 = 00X); 
40 Rg. 10 is a diagram of the data structure of a pack; 
Rg. 11 is a diagram of the pack header hierarchy; 
Rg. 12 is a diagram illustrating the data structure of an audio area; 

Rgs. 13 and 14 are diagrams illustrating the data structure of a pre-sync block and a post-sync block, re- 
spectively; 

45 Rg. 1 5 is a diagram illustrating the data structure of one of the fourteen sync blocks in the audio data por- 
tion of the audio area of a track shown in Rg. 12; 

Rg. 16 is a schematic diagram showing the portion of AAUX arranged in the track direction; 
Rg. 17 is a diagram illustrating the data structure of a video area of a track; 

Rg. 18 is a diagram illustrating the data structure of one of the 149 sync blocks in the video data portion 
so of the video area of a track shown in Fig. 17; 

Fig. 19 is a diagram illustrating a vertical arrangement of 149 sync blocks in the video data portion of the 
video area of a track shown in Fig. 17; 

Fig. 20 is a diagram illustrating the data structure of a sub-code area of a track; 
Fig. 21 is a diagram illustrating the data structure of one of the sync blocks in the sub-code data portion 
55 of the sub-code area of a track shown in Rg. 20; 

Fig. 22 is a block diagram of a recording portion of a digital VCR in which the present invention is applied; 
Rg. 23 is a block diagram of AAUX pack data generating circuitry; 

Figs. 24A and 24B are a block diagram of a reproducing portion of a digital VCR in which the present in- 
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vention is applied; 

Fig. 24C is a block diagram of a portion of a digital VCR in which the present invention is applied; 

Fig. 25 is a block diagram of AAUX pack data reproducing circuitry; 

Fig. 26 is a table showing the data arrangement of an AAUX source pack; 

Fig. 27 is a table showing the values of the AUDIO MODE information in header byte PC2; 

Figs. 28A-28E are diagrams illustrating the speaker positions of each channel of each stereo mode; 

Figs. 29A-29D and 30A-30H are diagrams to which reference is made in explaining audio control data for 

SD and HD signals, respectively; 

Fig. 31 is a table showing combinations of the control data SM, CHN and PA recorded in the AAUX area 
of a track for various types of audio signals; 

Figs. 32A and 32B show an AAUX pack having two and three multilingual bits, respectively; 

Figs. 33-35 show examples of using a one bit, two bit and three bit multilingual flag, respectively; and 

Fig. 36 is a flowchart illustrating operation of an apparatus emibodying the invention with a multilingual 

indication. 

With reference to the accompanying drawings, a preferred embodiment of the present invention will be 
described. In the preferred embodiment, the present invention is applied to a digital VCR that compresses dig- 
ital video signals and records and reproduces the signals to and from a tape. However, the present invention 
can be applied to a recording and reproducing apparatus that uses another recording medium such as an opt- 
ical disc. Moreover, the present invention can also be applied to a system that transmits digital audio signals 
through a communication line. 

In a digital VCR that compresses digital video data, composite digital color video data is separated into a 
luminance signal Y and color difference signals (R - Y) and (B - Y). The separated signals are compressed 
using, for example, an orthogonal transformation such as a discrete cosine transformation (DCT) and variable 
length encoding. The compressed signals are recorded on a magnetic tape by a rotating head in a SD system 
(525 lines/60 Hz or 652 lines/50 Hz) or a HD system (1125 lines/60 Hz or 1250 lines/50 Hz). 

For such a digital VCR that can be used as a general-purpose recording and reproducing apparatus, the 
applicant of the present invention has proposed a data management system including an application ID. In the 
proposed system, a cassette has Video Auxiliary Data (VAUX), Audio Auxiliary Data (AAUX), sub-code data, 
and a memory in the cassette (MIC). After-recording of video data, insertion of video data, and recording of 
data (such as management signals for broadcasting stations and medical signals) superimposed in the vertical 
blanking interval are performed in a data unit referred to herein as a pack, described in detail below. 

With the application ID system, the cassette, mechanism, servo system, ITI area generating/detecting cir- 
cuit and.so forth of a home-use. digital VCR can be used in a completely different producLsuch as a data., 
streamer or a multiple-track digital tape recorder. In addition, the content of one area that has been predeter- 
mined can be defined corresponding to the application ID thereof. Depending on the value of the application 
ID, a variety of data such as video data, video data + audio data, and computer data can be designated. 

Data structures for a recording format in which the present invention may be applied will now be described 
with reference to Figs. 5-21. 

The application ID of the proposed system will now be described. 

Fig. 5A shows tracks formed obliquely on a tape for use with a d igrtaJ VCR embodying the present invention. 
Fig. 5B shows one of the tracks illustrated in Fig. 5A. At the track entering side, a timing block is formed so 
as to securely perform an after-recording operation. This timing block is referred to as an ITI (Insert and Track 
Information). The ITI is used to precisely align an area of data that is rewritten in an after-recording operation. 
The ITI should always be formed so that the track can be rewritten. 

In the ITI area, a large number of sync blocks with a short sync length are written. The sync blocks are 
assigned sync numbers successively beginning from the track entering side. In performing an after-recording 
operation, when arty sync block is detected, the position on the current track can be precisely determined. Thus, 
the area of the after-recording operation is defined corresponding to the position on the current track. Gener- 
ally, due to the mechanical accuracy of the head or the like, since the head cannot stably contact the track 
entering side, the sync length is shortened and many sync blocks are written so as to raise the detecting prob- 
ability. 

Fig. 6 is a diagram of an ITI area. As shown in Fig. 6, the ITI area includes a preamble, a Start Sync Block 
Area (SSA), a Track Information Area (T1A), and a postamble. 

Trie preamble has 1400 bits and functions as a run-in area for a phase locked loop used during reproduc- 
tion. 

The SSA comprises 61 blocks, each of which has 30 bits. 

The TIA is comprises 3 blocks having a total of 90 bits, and is used for storing information relating to the 
entire track: an Application ID of a Track (APT) of three bits, an SP/LP bit (that represents a track pitch and 
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is recorded three times for redundancy), a reserve bit, and a Pilot Frame (PF) bit that represents the reference 
frame of a servo system. 

The postamble is a margin that has 280 bits. 

A MIC cassette includes a circuit board with an integrated circuit (IC) memory. When the cassette is mount- 
5 ed in a VCR, data written in the memory IC is read. The memory IC can store Table Of Contents (TOC) infor- 
mation, index information, character information, reproduction control information, and timer record informa- 
tion as well as tape intrinsic information such as tape length, tape thickness, and tape type. Corresponding to 
the data read from the MIC, predetermined operations such as skipping of a particular program, setting of the 
reproducing order of programs, and reproducing a still image (photo image) with a particular program, and timer 
10 recording operation can be performed. 

The application ID is stored in the high order three bits of address 0 of the MIC as an APM (Application ID 
of MIC) which defines the data structure of the MIC, and is also stored in the TIA area as an APT (Application 
ID of Track) which defines the data structure of the track. The application ID defines the data structure of the 
area rather than the application. 
15 Fig. 7 is a more detailed diagram of the track shown in Fig. 5B. After the ITI area, the track is divided into 
several areas AREA 1 , AREA 2 ... AREAn. The APT indicates the data structure such as positions of the divided 
areas on the track, the structure of sync blocks, and the structure of an error correction code (ECC). In addition, 
each of the areas AREA 1 , AREA 2 .. . AREA n has an application IDAP1, AP2 ... APn, respectively, that defines 
the data structure thereof. 

20 Fig. 8 is a diagram illustrating the hierarchical structure of the application I Ds for the track areas. The APT, 
which is the primary track application ID, defines the number of areas of the track. In Fig. 8, two hierarchical 
levels are shown, but additional lower hierarchical levels may be provided. The APM, which is the application 
ID in the MIC, always indicates one hierarchical level. The value of the APT is also written to the APM by the 
digital VCR. 

25 Fig. 9A shows the track structure when APT = 000. AREA 1 , AREA 2 and AREA 3 are defined on the track. 
In addition, the positions of the areas, the structure of the sync blocks, the structure of the ECC, gaps for pro- 
tecting each area, and an overwrite margin for protecting overwrite data are defined. Each of AREA 1, AREA 

2 and AREA 3 has an application ID AP1 , AP2 and AP3, respectively, that defines the data structure thereof. 
When APT = 000, each area of the AAUX, the VAUX, the sub-code, and the MIC is written in a common pack 

30 structure. 

Fig. 9B shows the track area usage when AP1 = AP2 = AP3 = 000. When AP1 = 000, AREA 1 represents 
the data structure for audio of a consumer digital VCR (CVCR) audio auxiliary data (AAUX). When AP2 = 000, 
AREA 2 represents the data structure forvideo of CVCR video auxiliary data (VAUX). When AP3 = 000, AREA 

3 uses the data structure of sub-code of CVCR sub-code ID. In addition, the value of the APM is 000. 

35 Fig. 1 0 is a diagram of the data structure of a pack. One pack generally comprises five bytes (PC0 to PC4). 
The pack is a minimum unit of a data group. One pack is composed of related data. The first byte is a header 
and the remaining four bytes are data. However, only when character data is written to the MIC, a variable length 
pack structure is employed to more effectively use the limited buffer memory. 

Fig. 11 is a diagram of the pack header hierarchy. The high order four bits and the low order four bits of a 

40 pack header are hierarchically structured as an upper header and a lower header, respectively, comprising a 
pack header table with capacity for 256 entries. In addition, the number of levels can be increased by assigning 
bits. In this hierarchical structure, the content of the pack can be clearly structured and easily expanded. The 
pack header table is provided along with the content of each pack. With the pack header table, each area is 
written. 

45 Fig. 12 is a diagram illustrating the data structure of an audio area of a track, also referred to as an audio 
sector, and comprising a preamble, a data portion, and a postamble. The preamble has 500 bits and consists 
of a run-up area of 400 bits used for a run-up pattern for a phase locked loop and two pre-sync blocks used 
to pre-detect an audio sync block. The data portion consists of audio data of 10500 bits. The postamble has 
550 bits and consists of one post-sync block, representing the end of the audio sector with the sync number 

so of the ID, and a guard area of 500 bits used to prevent an audio sector that is after- recorded from entering 
the next video sector. 

Figs. 13 and 14 are diagrams illustrating the data structure of a pre-sync block and a post-sync block, re- 
spectively. Each pre-sync block and post-sync block has six bytes. The sixth byte of the pre-sync block is an 
SP/LP identifying byte. When the value of the SP/LP identifying byte is FFh, it represents the SP mode. When 
55 the value of the SP/LP identifying byte is OOh, it represents the LP mode. The sixth byte of the post-sync block 
is dummy data, FFh. The SP/LP identifying byte is also present in the TIA area as an SP/LP flag. The SP/LP 
identifying byte of the pre-sync block is used as a backup of the SP/LP flag in the TIA area. In other words, 
when the value of the TIA area can be read, it is used. Otherwise, the SP/LP identifying byte of the pre-sync 
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block is used. Six bytes of each of the pre-sync block and post-sync block are recorded after 24-25 conversion 
is performed. The 24-25 conversion is a modulating system in which data of 24 bytes is converted into data 
of 25 bits. Thus, the bit length of each of the pre-sync block and the post-sync block is as follows. 

Pre - sync block (6 x 2 x 8 x 25)/24 = 100 bits 

5 Post - sync block (6 x 1 x 8 x 25)/24 = 50 bits 

Fig. 15 is a diagram illustrating the data structure of one of the fourteen sync blocks in the audio data por- 
tion of the audio area of a track shown in Fig. 12, also referred to as an audio sync block. An audio sync block 
has 90 bytes. The first five bytes of the audio sync block have the same structure as those of the pre-sync 
block and the post-sync block. A data portion of an audio sync block has 77 bytes that are protected by a hor- 

w izontal parity C1 (eight bytes) and a vertical parity C2 (five sync blocks). These audio sync blocks are recorded 
after the 24-25 conversion is performed. Thus, the total bit length of the audio sync block in a track is: 

(90x14 x 8 x 25)/24 = 10500 bits 
The first five bytes of the data portion are used for AAUX and comprise one pack. Each track has nine packs. 
Numbers 0 to 8 in Fig. 15 represent pack numbers of the track. 

15 As shown in Rg. 15, the audio area of a track includes nine sync blocks providing capacity for 72 x 9 = 
648 bytes. Rve tracks have audio data capacity of 648 x 5 = 3240 bytes, which is the amount of digitized audio 
signal for one video frame. In other words, the is the amount of digital audio data on one channel every five 
tracks (60 Hz system) or six tracks (50 Hz system) in 16-bit mode. 

Fig. 16 is a schematic diagram showing the portion of AAUX arranged in the track direction. Audio data 

20 and sub-code data are recorded and reproduced as video frames. In Fig. 16, numbers 50 to 55 represent values 
of pack headers (in hexadecimal notation). As shown in Rg. 16, the same pack is redundantly written in each 
of 10 tracks for error protection. Thus, the data in the main area can be reproduced even if horizontal scratches 
and channel clogging take place in transporting the tape. The portion to which the pack headers are written 
is referred to as a main area, used for storing essential items necessary for reproducing the audio data, such 

25 as sampling frequency and quantizing bit number. 

The remaining packs are successively connected and used as an optional area. In Rg. 16, the remaining 

packs are connected in the arrow direction as in a, b, c, d, e, f, g, h skipping the packs in the main area. 

One video frame has 30 packs (525 lines/60 Hz system) or 36 packs (625 lines/50 Hz system) in the optional 
area. In the optional area, packs can be freely selected from the pack header table corresponding to each par- 

30 ticular digital VCR format 

Rg. 17 is a diagram illustrating the data structure of a video area of a track, also referred to as a video 
sector, and comprising a preamble, a video data portion of 149 sync blocks, and a postamble. The video sector 
c preamble and postamble data structures are as shpwn jriFigs. 1 3 and 14. However, the number of bits of the. 
guard area of the video sector postamble is larger than that of the audio sector postamble. 

35 Fig. 18 is a diagram illustrating the data structure of one of the 149 sync blocks in the video data portion 
of the video area of a track shown in Fig. 17, also referred to as a video sync block. As with the audio sync 
block, one video sync block has 90 bytes. The first five bytes of the video sync block have the same structure 
as those of the pre-sync block and the post-sync block. The video sync block data portion has 77 bytes that 
are protected by a horizontal parity C1 (eight bytes) and a vertical parity C2 (11 sync blocks). The video sync 

ao blocks are recorded after the 24-25 conversion is performed. The total bit length of the video sync blocks in a 
track is: 

(90x149 x 8 x 25)/24 = 111750 bits 
Rg. 19 is a diagram illustrating a vertical arrangement of 149 sync blocks in the video data portion of the 
video area of a track shown in Fig. 1 7. Two upper sync blocks and one sync block that just precedes the C2 

45 parfty are sync blocks dedicated for VAUX Data of 77 bytes is used as VALTX data. In the sync blocks other 
than the VAUX dedicated sync blocks arid the C2 sync blocks, namely, the 135 sync blocks in the middle of 
Fig. 19, DCT compressed video data is stored. Each of BUF 0 to BUF 26 represents a buffering unit One buf- 
fering unit is composed of five sync blocks. Each track has 27 buffering units. Thus, 10 tracks, which is one 
video frame, have 270 buffering units. 

so Generally, a valid image area is extracted from image data for one frame and then sampled. The resultant 
digital data is shuffled and 270 groups are collected from various portions of the real image. One of the 270 
groups is a buffering unit Each group is compressed corresponding to a compressing technique such as DCT 
technique, quantizing technique, or variable length code encoding technique in such a manner that the amount 
of data representing aD of the groups is a predetermined compressing value. Typically, a quantizing step of 

55 which the amount of generated data is the desired value or less is determined and used to encode the data. 
The generated encoded data is packed in one buffering block, which has five syncs. 

Rg. 20 is a diagram illustrating the data structure of a sub-code area of a track, also referred to as a sub- 
code sector, and comprising a preamble, a sub-code data portion , and a postamble. The sub-code sector pre- 
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amble and postamble data structures do not have a pre-sync block and a post-sync block, and so are unlike 
the audio and video sector preamble and postamble. In addition, the length of the sub-code sector is shorter 
than that of each of other sectors because the sub-code sector is used frequently for writing an index, an up 
to 200 times high speed searching operation, or the like. Furthermore, since the sub-code sector is formed at 

5 the last portion of the track, it is affected by an error that takes place at the beginning of the track. 

Fig. 21 is a diagram illustrating the data structure of one of the sync blocks in the sub-code data portion 
of the sub-code area of a track shown in Fig. 20, also referred to as a sub-code sync block. The length of the 
sub-code sync block is 12 bytes. The structure of the first five bytes of the sub-code sync block is the same 
as that of the audio-sync block and the video-sync block. The next portion of five bytes is a data portion that 

10 comprises packs. The horizontal parity C1 has two bytes and protects the data portion. Unlike with the audio 
sector and the video sector, in the sub-code sector, a product code structure using C1 and C2 is not employed. 
This is because the sub-code sector is mainly used for high speed searching operation. The C2 parity is not 
often reproduced along with the C1 parity. Each track has 1 2 sub-code sync blocks. The sub-code sync blocks 
are recorded after the 24-25 conversion is performed. The total bit length of the sub-code sync blocks in a 

15 track is: 

(12x12x8x25)724 = 1200 bits. 
A digital VCR that uses the data structures described with reference to Figs. 5-21 will now be described 
with reference to Figs. 22-25. 

Fig. 22 is a block diagram of a recording portion of a digital VCR in which the present invention is applied. 
20 In the digital VCR embodying the present invention, the digital luminance signal (Y), and the color difference 
signals (R - Y) and (B - Y) are compressed and recorded on the video sector. The digital audio signal is recorded 
on the audio sector. In addition, VAUX and AAUX are recorded in a pack structure. 

Antenna 1 of Fig. 22 receives a broadcast television signal, and supplies it to tuner 2 which demodulates 
the TV signal into a composite color video signal (corresponding to such as NTSC system or PAL system) and 
25 an audio signal. Tuner 2 supplies the composite video signal to a switch 3a, and supplies the audio signal to 
a switch 3b. 

External video input terminal 4 receives an analog composite video color video signal and supplies it to 
the switch 3a. External audio input terminal 5 receives an analog audio signal and supplies it to the switch 3b. 

The switch 3a selects one of the composite video signal received from the tuner portion 2 and the com- 
30 posite video signal received from the external video input terminal 4. The output of the switch 3a is supplied 
to a Y/C separating circuit 6 and a synchronization separating circuit 11. 

The Y/C separating circuit 6 separates a luminance signal (Y) and color difference signals (R - Y) and (B 
- Y) from the composite video signal, and supplies these signals tq low pass filters 7a, 7b, and 7c, respectively, 
which limit the band of the input signal so as to remove a loop-back distortion. The cut-off frequencies of the 
35 low pass filters 7a, 7b, and 7care 5.75 MHz, 1.45 MHz and 1.45 MHz, respectively, in the NTSC 4:1:1 system. 
The cut-off frequencies are changed if a different system is used, such as a PAL or SECAM 4:2:0 system. 

A/D converters 8a, 8b, and 8c are adapted to receive the low pass filtered signals and to sample the lu- 
minance signal (Y) at a sampling frequency of 1 3.5 MHz (4x rate), the color difference signal (R - Y) at a sam- 
pling frequency of 3.375 MHz (1x rate), and the color difference signal (B - Y) at a sampling frequency of 3.375 
40 MHz (1x rate), respectively, in accordance with dock signals supplied from phase locked loop (PLL) circuit 12 
and frequency divider 13. 

The synchronization separating circuit 11 produces a vertical synchronization signal (V sync) and a hori- 
zontal synchronization signal (H sync) and supplies these signals to PLL circuit 12 which generates a dock 
signal at the 4x rate sampling frequency 1 3.5 MHz that is locked to the input video signal and applies the 1 3.5 

45 MHz dock signal to the A/D converter 8a and to the frequency divider 1 3. The frequency divider 13 generates 
a dock signal at 1/4 of the 1 3.5 MHz frequency, that is, a Ix rate sampling dock signal at 3.375 MHz, and applies 
the Ix rate dock signal to the A/D converters 8b and 8c. 

Digital component video signals (Y), (R - Y), and (B - Y) are supplied from the A/D converters 8a, 8b, and 
8c to blocking drcuit 9 which converts the raster scanned data into blocks of eight samples x eight lines and 

50 supplies the blocked video data to shuffling circuit 1 0. The shuffling circuit 1 0 shuffles the blocks. The shuffling 
process is performed so as to prevent data recorded on the tape from being lost due to head dogging and 
horizontal scratches of the tape. In addition, the shuffling drcuit 10 changes the order of the blocks so that 
the luminance signal and the color difference signals can be more readily processed in subsequent drcuits. 
The shuffled blocks are supplied to a data compressing and encoding portion 14, comprising a compress- 

55 jng drcuit that performs DCT compression, an estimator that determines whether the data has been com- 
pressed to a predetermined level, and a quantizing device that quantizes the compressed data with a quantizing 
step corresponding to the determination by the estimator. The compressed video data is packed in a prede- 
termined sync block by a framing circuit 15 according to a predetermined rule. The output of the framing drcuit 
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15 is supplied to a combining circuit 16. 

Switch 3b selects the audio signal received from the tuner 2 or the audio signal received from the external 
audio signal input terminal 5, and supplies the selected analog audio signal to an A/D converter 21 that converts 
the selected analog audio signal into digital audio signal and applies the digital audio signal to a shuffling circuit 
5 22. The shuffling circuit 22 shuffles the digital audio data. The output of the shuffling circuit 22 is supplied to 
a framing circuit 23. The framing circuit 23 packs the audio data in an audio sync block. The output of the fram- 
ing circuit 23 is supplied to a combining circuit 24. 

Mode process microcomputer 34 that manages the modes of the VCR includes a user interface. The mode 
process microcomputer 34 operates corresponding to a field frequency of the TV image (60 Hz or 50 Hz). The 
10 mode process microcomputer 34 generates pack data of video auxiliary data VAUX, audio auxil iary data AAUX, 
and sub-code. 

Signal processing microcomputer 20 operates in synchronization with the rotation of a drum, namely, at 
9000 rpm and 150 Hz. Signal processing microcomputer 20 generates video auxiliary data (VAUX), audio aux- 
iliary data (AAUX), sub-code data, an absolute track number contained in a "title end* pack and a time title 
is code (TTC) stored in the sub-code. The video auxiliary data VAUX is supplied to a VAUX circuit 17. The audio 
auxiliary data AAUX is supplied to an AAUX circuit 19. 

Combining circuit 1 6 combines the output of a framing circuit 1 5 and the VAUX data from circuit 1 7. Com- 
bining circuit 24 combines the output of a framing circuit 23 and the AAUX data from circuit 19. The outputs 
of the combining circuits 16 and 24 are respectively supplied as VDATAand ADATA to a switch 26. 
20 A sub-code circuit 18 generates data SID of the ID portion, AP3, and sub-code pack data SDATA corre- 
sponding to the output of the signal processing microcomputer 20. The generated data are supplied to a switch 
26. Sync generating circuit 25 generates each ID portion of audio and video sync blocks including AP1 and 
AP2 for each ID portion, pre-sync, and post-sync and supplies the generated data to a switch 26. 

The switch 26 selects one of the outputs of the circuit 25, ADATA, VDATA, SID, and SDATA at a predeter- 
25 mined timing. The output of the switch circuit 26 is supplied to an error correction code generating circuit 27 
which adds a predetermined parity to the output of the switch circuit 26 and supplies the parity added infor- 
mation to a randomizing circuit 29. Trie randomizing circuit 29 randomizes the output of the error correction 
code generating circuit 27 to prevent an occurrence of a string of data having the same value (zero or one) 
and exceeding a predetermined length. The output of the randomizing circuit 29 is supplied to a 24/25 con- 
30 verting circuit 30 that converts 24-bit data into 25-bit data so as to remove a DC component for magnetic re- 
cording and reproducing operation. In addition, PRIV (partial response, class 4) coding process (1/1-D2) (not 
shown) suitable for digital recording is performed. 

The output of the 24/25 converting circuit 30 is supplied to a combining circuit 31 which combines the output 
of the 24/25 converting circuit and a sync pattern of audio, video, and sub-code. The output of the combining 
35 circuit 31 is supplied to a switch 32. 

The mode process microcomputer 34 applies data APT, SP/LP, and PF to an ITI circuit 33 which generates 
ITI sector data and supplies the ITI sector data to switch 32. 

The switch 32 selects one of the output of combining circuit 31, the ITI sector data and an amble pattern 
at a predetermined timing. The data selected by the switch 32 is supplied to a switch 35 which forwards the 
40 selected data to a head amplifier 36a or a head amplifier 36b at a head switching timing. The amplifiers 36a 
and 36b amplify the selected data and apply the amplified data to heads 37a and 37b, respectively. 

The VCR includes an external switch block 40 employed by a use to select various modes such as recording 
and reproducing. The switch block 40 includes an SP/LP record mode setting switch. The switch signals of the 
switch block 40 are supplied to a mechanical control microcomputer 28 and the signal processing microcom- 
45 puter20. 

Fig. 23 is a block diagram of AAUX pack data generating circuitry . Circuits 201 -207 may be microcomputer 
programs executed by mode process microcomputer 34. 

Main area data collecting and generating circuit 201 receives serial copy management system data (SCMS) 
from a digital bus and audio mode data, sampling frequency (SMP), number of quantizing bits (QU) and CP 
so from a tuner and generates a main area data group assembled in a bit/byte structure of a main pack. The main 
area data group includes a multilingual indication, discussed in detail below. Switch 202 adds an appropriate 
main area pack header to the main area data group and supplies the resultant data through a switch 206 to a 
parallel-to-serial (PS) converting circuit 208. 

Optional area data collecting circuit 203 receives a title of a digital audio music program of a PCM broadcast 
55 along with a title of a TV program from a tuner and generates an optional area data group. In digital audio sig- 
nals of so-called A mode and B mode received from the tuner, a sampling frequency, the number of quantizing 
bits, and so forth may be predetermined. To generate an AAUX closed caption pack (55h), a closed caption 
signal in a vertical blanking interval is received from the tuner. Audio information is extracted by a decoder 
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210. The audio information is stored in each pack of addresses (50h) and (51 h). A pack recorded in the optional 
area may differ by type of VCR. Setting circuit 204 produces an optional area pack header and supplies the 
header to switch 205 which adds it to the optional area data group and supplies the resultant data through switch 
206 to PS converting circuit 208 in accordance with timing signals provided by a timing adjusting circuit 207. 

5 The PS converting circuit 208 converts the AAUX data into serial data and supplies the serial data to a 

signal processing microcomputer 20 in a predetermined inter-microcomputer communication protocol. The sig- 
nal processing microcomputer 20 converts the serial data to parallel data and stores it in a buffer. Correspond- 
ing to a command issued from the AAUX circuit 19, the parallel data is successively read and sent to a com- 
bining circuit 24 at a predetermined timing. 

10 Rgs. 24A and 24B are a block diagram of a reproducing portion of a digital VCR in which the present in- 
vention is applied. 

Heads 101a and 101b produce reproduced data signals and supply these signals to head amplifiers 1 02a 
and 102b, respectively, which amplify the reproduced data signals and apply the amplified signals to switch 
103. The data selected by the switch 103 is supplied to an equalizer circuit 104 which performs an inverse 
15 emphasis process. When data is recorded, to improve electromagnetic converting characteristics of the tape 
and the magnetic heads, a so-called emphasis process (for example, partial response, class 4) is performed. 

The output of the equalizer circuit 104 is supplied to an A/D converter 106 and a clock extracting circuit 
105. The clock extracting circuit 105 extracts a dock signal from the reproduced signals. The output of the 
equalizer circuit 104 is digitized by an A/D converter 106 at the a timing determined by the extracted clock 
20 signal. The resultant one-bit data is written to a first in first out memory (FIFO) 107, and read out therefrom 
to a sync pattern detecting circuit 108. 

A sync pattern of each area is supplied to the sync pattern detecting circuit 108 through a switch 109. The 
switch position of the switch 109 is changed corresponding to a timing circuit 113. The sync pattern detecting 
circuit 108 has a so-called flywheel construction so as to prevent an error from taking place. In the flywheel 
25 construction, whenever a sync pattern is detected, it is determined whether or not the same sync pattern has 
been received during an interval of a predetermined length. When the determined result is yes, for example, 
three times or more, the circuit 108 detects that the sync pattern is reliable. 

When a reliable sync pattern is detected, the shift amount for forming one sync block extracted from each 
stage of the FIFO 107 is determined. Corresponding to the shift amount, required bits are supplied through a 
30 switch 110 to a sync block settlement latch 111. The sync number of the obtained sync is extracted by an ex- 
tracting circuit 112 and supplied to a timing circuit 113. Since the head position on the track corresponds to 
the sync number, the switch positions of the switches 109 and 114 are changed. 

In the case of the ITI sector, the switch 114 is placed at the lower position. Aseparating circuit 115 separates 
an ITI sync pattern. The ITI sync pattern is supplied to an ITI decoder 116. In the ITI area, encoded data is 
35 recorded. Thus, by decoding the data in the ITI area, data of APT, SP/LP, and PF can be obtained. The decoded 
data is supplied to a mode process microcomputer 117 that determines the operation mode and the like of the 
VCR and is connected to an external operation key 118. The mode process microcomputer 117 controls the 
reproducing portion of the VCR in association with a mechanical control microcomputer 128 and a signal proc- 
ess microcomputer 151. 

40 In the case of an audio or video sector or sub-code sector, the switch 114 is placed in the upper position. 
A separating circuit 122 extracts a sync pattern of each sector and supplies the extracted sync pattern to a 
derandomizing circuit 124 through a 24725 inverse converting circuit 123 so as to restore an original data se- 
quence. The restored data is supplied to an error correcting circuit 125. 

The error correcting circuit 125 detects and corrects erroneous data. When data that cannot be corrected 

45 is present, an error flag is added thereto. The output of circuit 1 25 is applied to switch 126. 

Circuit 127 processes the ID portion of the A/V sector, the pre-sync, and the post-sync The circuit 127 
extracts the sync number, track number, and SP/LP stored in each sync of pre-sync and post-sync and supplies 
the extracted data to timing circuit 113 that generates various timing signals. The circuit 127 also extracts ap- 
plication IDs AP1 and AP2 and supplies them to the mode process microcomputer 117 which determines the 

so format corresponding to AP1 and APZ When AP1 and AP2 = 000, the mode process microcomputer 117 defines 
AREA 2 (see Figs. 7 and 9B) as an image data area and assumes that normal operation will occur. Otherwise, 
the mode process microcomputer performs a warning process. 

The mode process microcomputer 117 determines SP/LP information stored in the ITI area. In a TIA area 
of the HI area, the SP/LP information is written three times. By the rule of majority, the reliability of the SP/LP 

55 information is improved. In the audio sector and video sector, there are a total of four sync blocks in which the 
SP/LP information is written. By the rule of majority, the reliability is further improved, tf the SP/LP information 
written to the ITI area does not accord with that written to the pre-sync blocks, the SP/LP information written 
to the ITI area is used with higher precedence. 
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The reproduced data of a video sector is separated into video data and VAUX data by switch 129 shown 
in Rg. 24B. The video data is supplied to a def raining circuit 130 along with the error flag. The def naming circuit 
1 30 def names the video data and supplies image data is supplied to a data decompressing and decoding por- 
tion comprising dequantizing circuit 131 and decompressing circuit 132. The circuit 131 inverse quantizes the 
5 image data from def naming circuft 130, and supplies dequantized data to circuit 132 which inverse orthogonal 
transforms the dequantized data to produce decompressed data and supplies the decompressed data to a de- 
shuffling circuit 133 that deshuffles the decompressed data and applies the deshuffled data as luminance and 
color difference data to block desegmenting circuit 134 that restores the original image sequence. 

The desegmented luminance and color difference signals are supplied to D/A converters 135a, 135b, and 
w 135c, respectively, which generate analog signals in accordance with clock signals at 13.5 MHz, 3.375 MHz 
and 3.375 MHz, respectively (for NTSC signals). 

Oscillator 139 produces the 13.5 MHz clock signal based on the output of crystal oscillator 138, and sup- 
plies the 1 3.5 MHz clock signal to frequency divider 1 40 which produces therefrom clock signals at 6.75 MHz 
and 3.375 MHz. Oscillator 139 also applies the 13.5 MHz signal to synchronization signal generating circuit 
15 141 , which generates synchronization signals such as vertical and horizontal synchronization signals accord- 
ing to the NTSC standard, and provides the synchronization signals to combining circuit 137. 

The analog luminance and color difference signals are supplied to Y/C combining circuit 136 which com- 
bines them into a combined signal, and provides the combined signal to combining circuit 137. 

The combining circuit 137 combines the combined signal and synchronization signals and supplies the re- 
20 suiting composite analog video signal to output terminal 142. 

Data reproduced from the audio sector is supplied to a switch 143 that separates the data into audio data 
and AAUX data. The audio data is supplied to a deshuffling circuit 145. The deshuffling circuit 145 restores 
the original time base of the audio data. At this point, when necessary, the audio data is interpolated in accor- 
dance with the error flag. The output of the deshuffling circuit 145 is supplied to a D/A converter 146 which 
25 restores an analog audio signal and supplies it to an output terminal 147 in synchronization with the video data. 

VAUX and AAUX data selected by the switches 129 and 143 are supplied to a VAUX circuit 148 and an 
AAUX circuit 150, respectively. The VAUX circuit 148 and the AAUX circuit 150 perform pre-processing such 
as rule-of-majority processing for multiple write situations in accordance with the error flag. The ID portion and 
the data portion of the sub-code sector are supplied to a sub-code circuit 149. The sub-code circuit 149 per- 
30 forms a pre-processing such as the rule-of-majority process in accordance with the error flag. The output of 
the sub-code circuit 149 is supplied to a signal process microcomputer 151 that performs a final reading op- 
eration. 

, Fig. 25 is a block diagram of an AAUX pack data reproducing circuit 1 50. _ 
The AAUX data received through the switch 143 is divided into main area data and optional area data by 
35 a switch 301 controlled by a write side controller 302 at a predetermined timing. 

Pack header detecting circuit 303 reads the main area pack data header and controls switch 304 in ac- 
cordance therewith. When the pack data does not contain an error, the pack data is written to a main area mem- 
ory 305 which stores, for each data word, data of eight bits and an error flag of one bit The initial setting of 
the data content for each video frame in the main area memory 305 is, for example, "1" (no information). When 
40 an error is detected, no action is performed. When no error is detected, the error-free data and error flag "0" 
are written. Since the same pack is written to the main area 10 (NTSC) or 12 (PAL) times, when one video 
frame is completed, data with error flag "1" is finally recognized as erroneous data. 

Since one pack is written to the optional area, the error flag is written to an optional area FIFO 308 along 
with the data. 

45 The contents of the memory 305 and the FIFO 308 are supplied to a signal process microcomputer 151 
through switches 306 and 307 controlled by a read timing controller 309. The signal process-microcomputer 
151 analyzes data for the main area and the optional area corresponding to the received pack data and error 
flag. After the parallel data is converted into serial data, it is supplied to a mode process microcomputer 117. 
The mode process microcomputer 117 restores parallel data, decombines pack data, and analyzes it 

so A multilingual indication in a recording format according to an embodiment of the invention will now be 
described. 

It will be recalled that Fig. 15 shows nine sync blocks 0 ... 8 in the audio data portion of the audio area of 
a track shown in Fig. 12, and that each of these nine sync blocks includes a five byte AAUX pack. As shown 
in Fig. 10, the first byte (PC0) of each pack is a header. 
55 Rg. 26 is a table showing the data arrangement of an AAUX pack for source material to be recorded when 
the header byte PC0 = 50 h. There are many data structure types of packs corresponding to the header; the 
data structure in Fig. 26 shows a multilingual bit as an embodiment of the present invention. 
The data byte PC1 includes the following information: 
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LF (one bit) : represents whether or not the video sampling frequency and the audio sampling fre- 

quency are locked. Since the video sampling frequency is not a simple multiple of the 
audio sampling frequency, the number of audio samples per frame of video samples 
must be varied for the video and audio sampling frequencies to be locked. 
5 AFSIZE (6 bits) : represents the size (the number of audio samples) of the audio frame in one video 

frame. When the video and audio sampling frequencies are locked, AFSIZE differs 
from video frame to video frame. 

The data byte PC2 includes the following information: 

SM (one bit) : represents whether a lumped audio signal is recorded (SM = "1"). The lumped au- 

to dio signal represents a HD audio signal recorded using front two audio blocks and 

rear two audio blocks and the thus recorded audio signals are intended to be output 
at the same time. 

CHN (2 bits) : represents the number of audio channels within an audio block which spans five 

or six tracks. CHN = "(XT indicates that one channel is recorded within the audio 
15 block; CHN = "01 " indicates that two channels are recorded within the audio block; 

and other values of CHN are reserved. 

PA (one bit) : represents whether two channels are reproduced at the same time (PA = "0"). 

AUDIO MODE (4 bits) : represents the order of the audio data being recorded. 

The data byte PC3 includes the following information: 
20 ML (one bit) : represents whether or not data has been recorded with multiple languages. ML = 

"0" indicates that data has been recorded with multiple languages. ML = "1" indi- 
cates that data has not been recorded with multiple languages. 

50/60 (one bit) : represents the frame frequency of the video signal. 

STYPE (five bits) : represents whether the video signal is SD or HD. 

25 The data byte PC4 includes the following information: 

EF (one bit) : represents whether emphasis is present 

TC (one bit) : represents a time constant 

SMP (three bits) : represents a sampling frequency. 

QU (three bits) : represents the number of quantizing bits. 
30 Fig. 27 is a table showing the values of the AUDIO MODE information in header byte PC2. The definitions 

for the table of Fig. 27 are similar to the definitions for the table of Fig. 2, and also include the following: 

Wo : Woofer channel 

LC: Left center channel of 8-channel stereo . 
RC: Right center channel of 8-channel stereo 
35 Figs. 28A-28E are diagrams illustrating the speaker positions of each channel of each stereo mode. The 
listener position is assumed to be at the center of each circle. 

Fig. 28 A shows 3/0 stereo mode in which three speakers U C and R are in front of the listener and there 
is no speaker behind the listener. The center speaker is positioned directly in front of the listener. The right 
and left speakers are positioned 45 degrees away from the center speaker. 
40 Rg. 28B shows 3/1 stereo mode in which three speakers of L, C and R are in front of the listener and there 

is a surround speaker S behind the listener. The surround speaker is positioned directly behind the listener. 

Fig. 28C shows 2/2 stereo mode in which two speakers U R are in front of the listener and two speakers 
LS (Left Surround) and RS (Right Surround) are positioned at the sides of the listener. 

Fig. 28D shows 3/4 + Wo stereo mode in which three speakers U C and R are in front of the listener and 
45 four speakers Lsl, Ls2, Rsl, Rs2 are positioned behind the listener. Since the human ears cannot sense the 
- direction of low frequency sound, the position of the woofer speaker does hot matter. " *" " * 

Fig. 28E shows 5/2 + Wo stereo mode in which five speakers U Lc, C, Rc, R are in front of the listener 
and two speakers Ls, Rs are behind the listener. 

Figs. 29A-29D and 30A-30H are diagrams to which reference is made in explaining audio control data in- 
so eluding AUDIO MODE, PA, SM and CHN information for SD and HD signals, respectively. 

In Figs. 29A-29D, SM = 0 in all cases because standard definition signals are illustrated. 
Rg. 29A shows the audio control data in the case that a stereo signal is recorded in stereo two-channel 
mode. Since Left audio signal and Right audio signal are outputted at the same time, PA is "0". 

Rg. 29B shows an example in the case that two monaural signals with different contents (for example, 
55 bilingual program) are recorded. Since the two monaural signals may be outputted separately, PA is "1 ". 

Fig. 29C shows an example in the case that 3/1 stereo signals are recorded in the SD four-channel mode. 
Since LR (Left and Right audio signal) and CS (Center and Surround audio signal) may be outputted at the 
same time, PA is "0". Since the two audio signals are recorded in one audio block spanning five or six tracks, 
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CHN is "01". 

Fig. 29D shows an example in the case that two independent stereo audio signals (LR and L'R') are re- 
corded. Since the two stereo audio signals are outputted separately, PA is "1". CHN is "01" because the two 
audio signals are recorded in one audio block. 

In Figs. 30A-30C, the HD 4-channel mode is illustrated. CHN = W in these cases because one audio 
signal is recorded within one audio Mock. SM = 0 in these cases because the audio signal of front two audio 
blocks and the audio signal of rear two audio blocks are outputted separately. 

Fig. 30A shows an example in the case that a stereo signal is recorded using front two audio blocks in HD 
4 channel mode. Since L (Left audio signal) and R (Right audio signal) may be outputted at the same time, PA 
is"0". 

Fig. 30B shows an example in the case that two stereo signals LR and L'R' are recorded in HD 4 channel 
mode. Since L and R of front two audio blocks are outputted at the same time, PA of front two audio blocks 
are "0". Since L' and R' of rear two audio blocks are outputted at the same time, PA of rear two audio blocks 
are "0". 

Fig. 30C shows an example in the case that a stereo signal and two monaural signals are recorded in HD 
4 channel mode. Since L and R are outputted at the same time, PA of front two channels are "0". Since two 
monaural audio signal are outputted separately, PA of rear two audio blocks are "1 ". 

In Figs. 30D and 30E, the HD lumped mode (4 ch) is illustrated. In the case of a lumped audio signal, there 
in no need to record PA so that PA = "1" (no information). SM = 1 in these cases because the audio signals 
are all outputted at the same time, that is, they are lumped. 

Fig. 30D shows an example in the case that a 3/0 stereo signal is recorded in HD 4-channel mode. 

Rg. 30E shows an example in the case that a 3/1 stereo signal is recorded in HD 4-channel mode. 

In Figs. 30F and 30G, the HD 8-channel mode is illustrated. 

Fig. 30F shows an example in the case that a pair of 3/1 stereo signal is recorded in HD 8-channel mode. 
Since U R, C and S of front two audio blocks are outputted at the same time, PA = "0* for the front two audio 
blocks. Since L\ R\ C and S' of rear two audio blocks are outputted at the same time, PA = "0" for the rear 
two audio blocks. Since the audio signals of front two blocks (LRCS) and the audio signals of the rear two 
blocks (L'R'C'S*) are not outputted at the same time, this audio signal is not a lumped audio signal and SM=0. 

Fig. 30G shows an example in the case that four sets of stereo signals are recorded in HD 8-channel mode. 
Since L1 R1 of the first audio block and L2R2 of the second audio block are outputted separately, PA = "1 " for 
the front two audio blocks. Since L3R3 of the third audio block and L4R4 of the fourth audio block are outputted 
separately, PA = "1 " for the rear two audio blocks. Since the audio signal of each audio block is outputted sep- 
arately, this is not a lumped audio signal and SM=0. 

Fig. 30H illustrates the HD lumped mode (8 ch). Fig. 30H shows an example in the case that a 5/2 stereo 
signal and woofer signal are recorded in HD 8-channel mode. Since all recorded signals are outputted at the 
same time, this is a lumped audio signal and SM="1". Since this is a lumped audio signal, PA= "1" (no infor- 
mation). 

Fig. 31 is a table showing combinations of the control data SM, CHN and PA recorded in the AAUX area 
of a track for various types of audio signals. 

It will be recalled that Fig. 26 shows an AAUX pack having one multilingual bit ML in the data byte PC3. 
The one bit multilingual flag ML is defined as follows: 
O : The same program has been multilingually recorded. 
1 : The same program has not been multilingually recorded or it is unknown. 

It will be recalled that "multilingual" indicates audio with different contents in the same language or audio with 
the same content in different languages. 

• Rg. 32A shows an AAUX pack having two multilingual bits ML2, ML1 m the data byte PC3. The two-bit 
multilingual flag ML is defined as follows: 

00 : The same program has been multilingually recorded. However, there is no other program that has 

been multilingually recorded. 
01 : The same program has been multilingually recorded. However, there is one program that has been 

multilingually recorded. 

1 0 : The same program has been multilingually recorded. However, there are two or more programs that 

have been multilingually recorded. 
11 : The same program has not been multilingually recorded or it is unknown (this definition is required 

for after-recording). 

Rg. 32B shows an AAUX pack having three multilingual bits ML2, ML1, ML0. The bits ML2, ML1 are in 
the data byte PC3. The bit ML0 is in the data byte PC1 . The three-bit multilingual flag ML is defined as follows: 
000: There is one channel on which the same program has been recorded in a different language. 



13 



EP 0 682 421 A1 



001: There are two channels on which the same program has been recorded in different languages. 
010: There are three channels on which the same program has been recorded in different languages. 
011 : There are four channels on which the same program has been recorded in different languages. 
100: There are five channels on which the same program has been recorded in different languages. 
5 101: There are six channels on which the same program has been recorded in different languages. 
110: There are seven channels on which the same program has been recorded in different languages. 
111 : The same program has not been multilingually recorded or it is unknown (this definition is required 
for after-recording). 

Figs. 33-35 show examples of using a one bit. two bit and three bit multilingual flag, respectively. 
10 Figs. 33A-33G show examples of using a one bit multilingual flag. 

Fig. 33A shows an example in which two monaural signals M1 and M2 have been recorded in an SD four- 
channel format When M1 is a music signal and M2 is a narration signal, ML = 1. 

Fig. 33B shows an example in which two monaural signals M1 and M2 have been recorded in an SD tour- 
channel format When M1 is a Japanese audio and M2 is an English audio (in the case of bilingual audio), ML 
is - 0. In this multilingual recording mode, the Japanese sound that is a main sound has been recorded on five 
tracks prior to the English sound. This relation applies to other recording examples. 

When the record channel of the main sound is allocated in the case of the multilingual recording mode, 
the channel of the main sound can be selectively reproduced. As shown in Fig. 24C, channel selector 130 for 
selecting a channel of an audio signal is disposed between the deshuffling circuit 145 of the VCR reproducing 
20 circuit and the D/A converting circuit 146 shown in Fig. 24B. The selector 180 indicates to the user whether a 
multilingual signal has been recorded by activating display 182, which may be an alphanumeric display or a 
lamp on a console or remote device. The user selects among the available channels using switch block 181 
or another appropriate data entry device. 

Thus, the user can easily determine whether or not signals have been multilingually recorded correspond- 
25 ing to the multilingual flag ML For example, when the value of the flag ML is displayed, the user can know 
whether or not the signals have been multilingually recorded and can select a desired channel. 

In the above-description, the main audio has been recorded prior to the sub audio in one video frame. How- 
ever, the present invention is not limited to such a scheme. Instead, the user can allocate the record position 
of each language (for example, the first channel for English, the second channel for French). Alternatively, both 
30 an automatic mode for recording the main audio prior to the sub audio and a manual mode for allowing the 
user to allocate the position of each language can be provided. 

Fig. 33C shows an example in which four monaural signals M1, M2, M3, M4 have been recorded in an SD 
four-channel format When Ml is a Japanese sound and M2 is an English sound, ML = 0 for M1 and M2 (chan- 
nels 1 and 2). When M3 is a classic music sound and M4 is a rock music sound, ML = 1 for M3 and M4 (channels 
35 3 and 4). 

Fig. 33D shows an example in which two stereo signals L1, R1 and L2, R2 have been recorded in an SD 
four-channel format When L1 and R1 are classical music, ML = 1 for L1 and R1. When L2 and R2 are rock 
music, ML = 1 for L2 and R2. 

Fig. 33E shows another example in which two stereo signals L1, R1 and L2, R2 have been recorded in an 
40 SD four-channel format When L1 and R1 are Japanese movie speech, ML - 0 for L1 and R1. When l_2 and 
R2 are English movie speech, ML = O for L2 and R2. The Japanese speech (main audio) has been recorded 
before the English speech (sub audb). 

Fig. 33F shows an example of an HD eight-channel format When four stereo sounds have been recorded 
and L1 and R1 are English movie speech, ML = 0 for L1 and R1. When L2 and R2 are French movie speech, 
45 ML = 0 for L2 and R2. When L3 and R3 are Dutch movie speech, ML = O for L3 and R3. When L4 and R4 are 
- German movie speech sounds; ML = 0 for L4 and R4. Thus, four-language stereo signals that have been re- 
corded can be easily determined corresponding to the flag ML 

Fig. 33G shows another example of an HD eight-channel format in which two stereo signals and four mon- 
aural signals have been recorded. When L1 and R1 are English movie speech, ML = 0 for L1 and R1 (channels 
so A and B). When L2 and R2 are classic music sounds, ML = 1 for L2 and R2 (channels C and D). In this case, 
the English speech is treated as the main audio. When M1, M2, M3 and M4 are Dutch, German, Japanese 
and Spanish speech, respectively, ML = 0 for these signals (channels E-H) 

By counting the one-bit multilingual flags ML = 0 in one video frame, the number of languages recorded 
in one video frame can be determined. Audio with a record number selected by the user can be automatical] y 
55 selected and reproduced. When the user selects the second audio, the language of the second audio is se- 
lected and reproduced. 

Figs. 34A-34D show examples of using a two bit multilingual flag in an HD eight-channel format 

Fig. 34A shows an example in which four language movie speech of English, French, Dutch and German 
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have been recorded. In this case, all the two-bit multilingual flags ML are "00" indicating that although the same 
program has been multilinguaily recorded, there is no other program that has been multilingually recorded. 

Fig. 34B shows an example of which trilingual movie speech of English, Dutch, and German and classic 
music (L2 and R2) have been recorded. In this case, the multilingual flags for L2 and R2 are "11" indicating 
5 that the same program has not been multilingually recorded or it is unknown. The other multilingual flags are 
"00". 

Fig. 34C shows an example in which bilingual movie speech of Japanese and English have been recorded 
as L1 and R1 and L2 and R2, respectively, and Japanese news program sounds and English new program 
sounds have been recorded as L3 and R3 and L4 and R4, respectively. In this case, the multilingual flags ML 
10 for all channels are "01 " indicating that although the same program has been multilingually recorded, another 
program has been multilingually recorded. 

Fig. 34D shows an example in which bilingual movie speech of Japanese and English have been recorded 
as L1 and R1 and L2 and R2, respectively; a bilingual news program in Japanese and English has been re- 
corded as M3 and M4, respectively; and bilingual comic talk in Japanese and English have been recorded as 
15 M3 and M4, respectively. The main audio has been recorded prior to the sub audio. In this case, the multilingual 
flags ML for all channels are "10" indicating that although the same program has been multilingually recorded, 
there are two or more different programs that have been multilingually recorded. 

Figs. 35A-35F show examples of using a three bit multilingual flag in an HD eight-channel format 

Fig. 35A shows an example in which four language movie speech in English, French, Dutch, and German 
20 have been recorded as stereo signals. In this case, ML = 010 for all the channels indicating that there are three 
channels on which the same program has been recorded in different languages. 

Fig. 35B is an example in which three language movie speech in English, French and German have been 
recorded as stereo signals and classic music has been recorded as 12 and R2. In this case, ML = 111 for chan- 
nels of the classic music indicating that the same program has not been multilingually recorded or it is unknown. 
25 In addition, ML = 001 for the other channels indicating that there are two channels on which the same program 
has been recorded in different languages. 

Fig. 35C shows an example in which bilingual movie speech in Japanese and English have been recorded 
as L1 and R1 and 12 and R2, respectively and bilingual news programs in Japanese and English have been 
recorded as L3 and R3 and L4 and R4, respectively. In this case, the multilingual flags ML = 000 for all the 
30 channels indicating that there is one channel on which the same program has been recorded in a different lan- 
guage. 

Fig. 35D shows an example in which bilingual movie speech in Japanese and English have been recorded 
as L1 and R1 and L2 and R2, respectively, bilingual news programs in Japanese and English have been re- 
corded as M1 and M2, respectively, and bilingual comic talk programs in Japanese and English have been re- 

35 corded as M3 and M4, respectively. The multilingual flags ML = 000 for all the channels indicating that there 
is one channel on which the same program has been recorded in a different language. 

Fig. 35E shows an example in which monaural signals M1 to M8 have been recorded on eight channels. 
Baseball broadcast program audio consisting of Japanese general commentary, Japanese specific team com- 
mentary, English general commentary, and English specific team commentary have been recorded as M1, 

40 M2, M3 and M4, respectively. The multilingual flags ML = 01 0 for these signals (channels A-D) indicating that 
there are three channels on which the same program has been recorded in different languages. News in Jap- 
anese and English is recorded as M5 and M6, while comic speech in Japanese and English is recorded as M7 
and M8. The multilingual flags ML = 000 for these signals (channels E to H) indicating that there are two chan- 
nels on which the same program has been recorded in different languages. 

45 Fig. 35F shows another example in which monaural signals M1 to M8 have been recorded on eight chan- 

nels. Multilingual movie speech have been recorded as monaural signals M1 to M8. A Japanese movie speech- 
that is a main sound has been recorded prior to other language movie speech sounds. The other language 
movie speech sounds of English, German, Dutch, French, Russian, Italian and Spanish have been recorded 
as M2 to M8, respectively. The multilingual flags ML = 110 have been set for all the eight channels indicating 

so that there are seven channels in which the same program has been recorded in different languages. 

Fig. 36 is a flowchart illustrating operation of an apparatus embodying the present invention with a mul- 
tilingual indication. The flowchart of Fig. 36 may represent a program executed by the VCR of Fig. 24. It is 
assumed that the system has advised the user of the number of unrelated programs represented in the recorded 
audio channels, and that the user has selected one of these programs in accordance with a technique which 

55 is not part of the present invention. That is, the flowchart of Fig. 36 relates to the situation where a selected 
program may or may not have multilingual channels associated therewith. 

At step 61 0, N is initialized to a value of 1 . The parameter N indicates which audio chan nel of a set of related 
audio channels is being evaluated. In a four-channel system, the maximum value of N is four (e.g., a program 
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with audio in four languages), whereas in an eight-channel system, the maximum value of N is eight (e.g., a 
program with audio in eight languages). 

At step 620, the Nth audio block is reproduced. 

At step 630, it is determined whether the ML indicator has a value of zero. If not, for example, when ML = 
5 "1", then at step 670, the multilingual display is deactivated and at step 680, the program terminates. 

If it is determined that ML = "0" at step 630, then at step 640, the multilingual display is activated so that 
the user understands that at least one other language or commentary for the program being viewed is available. 
The multilingual display may be, for example, a lamp on a console or remote device, or a message displayed 
on a television set 

10 At step 650, it is determined whether the user has selected the next channel, such as by depressing a but- 
ton on a hand-held remote control unit, or entered a specific channel number. If not, the user is assumed to 
be satisfied with the currently reproduced language or commentary and, at step 680, the program terminates. 

If it is determined that the user has selected the next channel at step 650, then at step 660, N is incremented 
and the program returns to step 620. At step 660, if the just incremented value of N exceeds the number of 

is related audio channels for the selected program, then the number of related audio channels for the selected 
program is subtracted from the just incremented value of N, thereby allowing the user to loop through the mul- 
tilingual channels for the selected program. 

According to the above disclosure , it is determined whether or not transmitted digital audio signals have 
been multilingualty recorded. In other words, the user can know the languages of recorded sounds without need 

20 to reproduce all channels thereof. When an audio signal of the main sound has been recorded on a front sound 
channel in a predetermined region for example one video frame, the main sound can be automatically and ini- 
tially reproduced without need to change channels. In addition, since the user can select channels for languag- 
es which have been multilingually recorded, audio channels to be initially reproduced can be selected. 
Although an illustrative embodiment of the present invention, and various modifications thereof, have been 

25 described in detail herein with reference to the accompanying drawings, it is to be understood that the invention 
is not limited to this precise embodiment and the described modifications, and that various changes and further 
modifications may be effected therein by one skilled in the art without departing from the scope of the invention 
as defined in the appended claims. 

30 

Claims 

1 . A method for generating data, comprising the steps of: 

receiving a digital audio signal including a plurality of audio channels; 
36 generating a multilingual indication when at least two of the audio channels are associated with a 

common program; and 

combining the multilingual indication with the digital audio signal. 

2. The method of claim 1 , wherein the multilingual indication is a data bit 

40 

3. The method of claim 1 , further comprising the step of combining the multilingual indication with other audio 
management information. 

4. The method of claim 1 , further comprising the step of recording the digital audio signal, as combined with 
45 the multilingual indication, on a recording medium. 

5. The method of claim 4, wherein the step of recording is repeated for error protection. 

6. The method of claim 1 , further comprising the step of supplying the digital audio signal, as combined with 
so the multilingual indication, to a transmission medium. 

7. A method for reproducing data, comprising the steps of. 

receiving a digital audio signal including a plurality of audio channels and a multilingual indication 
indicating whether at least two of the audio channels are associated with a common program; and 
55 activating a display when the multilingual indication indicates that at least two of the audio channels 

are associated with a common program. 



8. 



The method of claim 7, further comprising the steps of receiving an audio channel selection, and repro- 
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during the selected audio channel. 

9. The method of claim 7, further comprising the step of automatically reproducing a first of the audio chan- 
nels when at least two of the audio channels are associated with a common program. 

10. The method of claim 7, wherein the step of receiving repeatedly receives the multilingual indication, and 
further comprising the step of applying a rule-of-majority to the repeatedly received multilingual indication 
to determine an original value of the multilingual indication. 

11. An apparatus for generating data, comprising: 

means for receiving a digital audio signal including a plurality of audio channels; 
means for generating a multilingual indication when at least two of the audio channels are asso- 
ciated with a common program; and 

means for combining the multilingual indication with the digital audio signal. 

12. The apparatus of claim 11, wherein the multilingual indication is a data bit 

13. The apparatus of claim 11, further comprising means for combining the multilingual indication with other 
audio management information. 

14. The apparatus of claim 11, further comprising means for recording the digital audio signal, as combined 
with the multilingual indication, on a recording medium. 

15. The apparatus of claim 14, wherein the means for recording is operative to repeatedly record the digital 
audio signal, as combined with the multilingual indication, for error protection. 

1 6. The apparatus of claim 1 1 , further comprising means for supplying the digital audio signal, as combined 
with the multilingual indication, to a transmission medium. 

17. An apparatus for reproducing data, comprising: 

means for receiving a digital audio signal including a plurality of audio channels and a multilingual 
indication indicating whether at least two of the audio channels are associated with a common program; 
and *„...„ 

means lor activating a display when the multilingual indication indicates that at least two of the au- 
dio channels are associated with a common program. 

1 8. The apparatus of claim 17, further comprising means for receiving an audio channel selection, and means 
for reproducing the selected audio channel. 

19. The apparatus of claim 17, further comprising means for automatically reproducing a first of the audio 
channels when at least two of the audio channels are associated with a common program. 

20. The apparatus of claim 1 7, wherein the means for receiving repeatedly receives the multilingual indication, 
and further comprising means for applying a rule-of-majority to the repeatedly received multilingual indi- 
cation to determine an original value of the multilingual indication. 
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Fig. 3 
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Fig. 10 
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Fig. 12 
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Fig. 15 
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